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Abstract. To predict precisely severe toxicity of irinotecan, 
we evaluated the association of UGTIA variants, haplotypes 
and the combination of UGTIA genotypes to severe toxicity 
of irinotecan. UGT1A1*6 (211G>A), UGT1A1*28 (TAf,>TAy), 
UGT1A1*60 (-3279T>G), UGT1A7 (387T>G), UGT1A7 
(622T>C), and UGTlA9*lb (-n8Tg>T,o also named *22) 
were genotyped in 123 patients with metastatic colorectal 
cancer who had received irinotecan-based chemotherapy. 
Among the 123 patients, 73 were enrolled in either of two 
phase II studies of the FOLFIRI (leucovorin, 5-fluorouracil 
and irinotecan) regimen; these patients constituted the training 
population, which was used to construct the predicting system. 
The other 50 patients constituted the validation population; 
these 50 patients either had participated in a phase II study of 
irinotecan/5'-deoxy-5-fluorouridine or were among consecu- 
tive patients who received FOLFIRI therapy. This prediction 
system used sequential forward floating selection based on 
statistical pattern recognition using UGTIA genotypes, gender 
and age. Several UGTIA genotypes [UGT1A1*6, UGT1A7 
(387T>G), UGT1A7 (622T>C) and UGTlA9*lb] were asso- 
ciated with the irinotecan toxicity. Among the haplotypes, 
haplotype-I (UGTIAT. -3279T, TA^,, 211G; UGT1A7: 387T, 
622T; UGT1A9: T,o) and haplotype-II (UGTIAT. -3279T, TA,„ 
211A; UGT1A7: 387G, 622C; UGT1A9: T^) were also associ- 
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ated with irinotecan toxicity. Furthermore, our new system for 
predicting the risk of irinotecan toxicity was 83.9% accurate 
with the training population and 72.1% accurate with the vali- 
dation population. Our novel prediction system using statistical 
pattern recognition depend on genotypes in UGTIA, age and 
gender; moreover, it showed high predictive performance even 
though the treatment regimens differed among the training 
and validation patients. 

Introduction 

Concurrent irinotecan and fluorinated-pyrimidine is a common 
first-line therapy for metastatic colorectal cancer (mCRC) 
(1-6). Although prolonged survival is associated with regi- 
mens involving irinotecan, severe neutropenia occurs in 
20-35% of mCRC cases treated with irinotecan regimens. 
Carboxylesterases catabolized irinotecan to 7-ethyl-lO-hydro- 
xycamptothecin (SN-38), which is a potent topoisomerase I 
inhibitor (7,8). SN-38 is then further catabolized by hepatic 
uridin diphosphate-glucuronosyltransferase (UGT) lA 
enzymes to an inactive SN-38 glucuronide (SN-38G) (9). Many 
mCRC patients with a genetic variant (UGTIAP'28) experi- 
ence severe irinotecan toxicity; UGT1A1*28 is a variation 
in the number (seven vs. six) of TA repeats in the promoter 
region of UGTlAl (10,11). Interestingly, the toxicity and 
tumor response of concurrent leucovorin, 5-fluorouracil, and 
irinotecan (FOLFIRI) reportedly also correlate with UGTIA 
variants (UGTlAl, UGT1A7 and UGT1A9) and haplotypes 
including these variants (12-18). There are differences between 
Caucasian and Asian populations in frequencies of UGTIA 
variants, and UGT1A1*6 reportedly associates strongly with 
severe neutropenia especially among Asian patients (12,17). 

To predict the risk of irinotecan toxicity for individual 
patients, it is important that determining the relative contri- 
butions of UGTIA variants other than UGT1A1*28 and 
UGT1A1*6 is important to the development of any system 
designed to predict irinotecan toxicity for individual patients 
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because patients without UGT1A1*28 or *6 do experience 
severe irinotecan toxicity. Several studies have examined asso- 
ciations between irinotecan toxicity and UGTIA haplotypes in 
addition to each genotype of UGTIA (17-19). However, deter- 
mining the haplotype or diplotype for each patient is difficult; 
moreover, most haplotypes and diplotypes are too rare to 
constitute a group large enough for meaningful statistical 
analysis. Moreover, gender and age of patients each reportedly 
have an impact on irinotecan toxicity (20-22). Hence, these 
factors should be also taken into consideration when devel- 
oping a system designed to predict irinotecan toxicity. 

The aim of this study was to evaluate whether the combi- 
nations of UGTIA genotypes, but not haplotypes, together 
with patient characteristics might be useful in predicting the 
risk to patients with mCRC treated of irinotecan-containing 
regimens. Here, we investigated the genotypes of 123 patients 
at six loci: UGT1A1*6 (211G>A, rs4148323), UGT1A1*28 
(TA6>TA7, rs8175347), UGT1A1*60 (-3279T>G, rs4124874), 
UGT1A7 (387T>G, rsl7868323), UGT1A7 (622T>C, 
rsll692021), and UGTlA9*lh (-118T<,>T,o, rs35426722, also 
called UGT1A9*22) (23). Next, we evaluated the contribu- 
tion of each UGTIA genotype, haplotype, and diplotype to 
the risk of irinotecan toxicity. Furthermore, we developed a 
new system for predicting the risk that a patient will experi- 
ence irinotecan toxicity; this system uses sequential forward 
floating selection (SFFS) algorithm based on statistical pattern 
recognition to select the combinations of UGTIA genotypes, 
gender and age. SFFS is a sequential search method character- 
ized by a dynamically changing number of features included 
or eliminated at each step of an individual analysis (24). This is 
the first study conducted to assess the role of the combination 
of genotypes at six polymorphic sites in UGTIA and clinical 
features constructed by SFFS on the risk of irinotecan toxicity. 

Materials and methods 

Patients. In this study, 123 mCRC patients were examined 
for association between UGTIA genotypes and irinotecan 
toxicity (Table 1). This study was performed as an ancillary 
investigation; data collected from three prospective studies 
[FLIGHTl (5), FLIGHT2 (5) and FRUTIRI (6)] and from 
consecutive patients who received FOLFIRI at the Department 
of Digestive Surgery and Surgical Oncology, Yamaguchi 
University Graduate School of Medicine, Japan. Each partici- 
pant received irinotecan at the dose of 150 mg/m^, which has 
been approved in Japan. 

FLIGHTl (UMIN000002388) and FLIGHT2 
(UMIN000002476) were phase II studies of first line and 
second line chemotherapy, respectively, for mCRC. Study 
designs and key eligibility and exclusion criteria have been 
described in detail (5,25,26). Briefly, each regimen consisted 
of irinotecan on day I H-400 mg/m- fluorouracil bolus followed 
by 2,400 mg/m^ fluorouracil continuous infusion during 
46 h H- 200 mg/m^ leucovorin on day 1 every 2 weeks. Of all 
patients from the FLIGHTl and FLIGHT2 studies, 38 and 35, 
respectively, participated in this ancillary investigation and use; 
these 73 patients constituted the training population. FLIGHTl 
or FLIGHT2 patients homozygous for UGT1A1*28 were 
excluded from the training population because these patients 
received a lower starting dose of irinotecan (100 mg/m^) (5). 



The validation population comprised 50 patients from 
two different study groups: 22 patients who participated 
in FRUTIRI (UMIN000005011), a phase H study of a 
combination therapy comprised irinotecan and 5'-deoxy-5-flu- 
orouridine (5'-DFUR) (6) and 28 consecutive patients who 
underwent second-line FOLFILI treatment between October, 
2008 and July, 2012 in the Department of Digestive Surgery 
and Surgical Oncology, Yamaguchi University Graduate 
School of Medicine, Japan. Detail treatment regimen tested 
in FRUTIRI was described previously (6). Briefly, irinotecan 
was administered every two weeks, and 400 mg 5'-DFUR was 
administered every week orally twice a day on five consecu- 
tive days that were followed by a weekly 2-day washout. The 
28 consecutive patients undergoing FOLFIRI treatment were 
following the protocol used in FLIGHT2 (26). In a validation 
population, patients with UGT1A1*28 homozygous were not 
found in the FRUTIRI study (n=28). Additionally, patients 
heterozygous for UGT1A1*28 (n=6) were excluded from the 
FRUTIRI study because these patients received lower starting 
dose of irinotecan 70 mg/m^. Among the 28 consecutive 
patients who received second-line FOLFILI therapy, homozy- 
gous for UGT1A1*6 or *28 and those compound heterozygous 
for UGT1A1*6 and UGT1A1*28 been excluded from this 
ancillary study. The training (n=73) and validation (n=50) 
populations did not differ significantly with regard to the 
distribution of any clinical feature or genotype that is listed in 
Table I except for the distributions of the UGT1A7 (387T>G) 
and UGTlA9*lh alleles (data not shown). 

In this study, we defined patients who exhibited hemato- 
logic toxicity greater than grade 3 during the entire course 
of therapy as experiencing irinotecan toxicity. The study 
protocols were approved by the Institutional Review Board at 
Yamaguchi University Graduate School of Medicine, and were 
carried out in accordance with the Helsinki declaration on 
experimentation on human subjects. Each patient gave written, 
informed consent for their participation in this study. 

Genotyping of UGTIA and haplotype construction. A conven- 
tional sodium iodide (Nal) method was used to extract genomic 
DNA from peripheral blood samples (27). The number of TA 
repeats in the UGTlAl promoter region was determined by 
the fragment size analysis followed by direct sequencing as 
described previously (4). The TaqMan technique with a hydro- 
lysis probe was used to determine the UGT1A1*6 genotype 
as described previously (28); similarly, hydrolysis probes were 
used to determine the genotypes at UGT1A1*60; a direct 
sequencing method was also used to determine the genotypes 
at UGT1A7 (387T>G and 622T>C) and UGTlA9*lb. 

Each nucleotide variant was evaluated to determine 
whether it was in Hardy-Weinberg equilibrium; Haploview 
4.2 software was used to perform the linkage disequilibrium 
(LD) and case-control haplotype analyses (29). Lewontin's 
coefficient D' and correlation coefficient were calculated as 
measures of LD. 

Construction of toxicity prediction system by genotype combi- 
nations. To predict severe toxicities of irinotecan, the age, the 
gender and a comprehensive 6-site UGTIA genotype were 
determined for each of the 73 patients in the training popu- 
lation. SFFS, a method of statistical pattern recognition, was 
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Table I. Characteristics of the patients. 



Sub-population (treatment regimen) 



Clinical features Total FLIGHTP FLIGHT2^ FRUTIRI" 2nd-hne FOLFILr 

and genotypes (n=123) (n=38) (n=35) (n=22) (n=28) 



Toxicity of irinotecan 



No 72 20 

Yes 51 18 

Gender 

Male 78 24 

Female 45 14 

Age 

<60 50 14 

>60 73 24 

UGT1A1*6 

-I- 84 25 

-1*6 36 12 

*6I*6 3 1 

UGT1A1*28 

-I- 103 32 

-1*28 20 6 

*28l*28 0 0' 

UGT1A1*60 

-I- 71 19 

-1*60 46 17 

*60l*60 6 2 

UGT1A7 

387T/T 41 13 

387T/G 69 18 

387G/G 13 7 

UGT1A7 

387T/T 70 21 

387T/G 48 15 

387G/G 5 2 

UGTlA9*lb 

*lbl*lb 43 14 

-l*lb 67 17 

-/- 13 7 



19 16 17 NS" 

16 6 11 

24 17 13 NS" 

11 5 15 

14 9 13 NS" 

21 13 15 

23 15 21 NS" 

11 6 7 

1 1 O'^ 

27 22 22 NS" 

8 0" 6 

Qa 0" 0"^ 

21 15 16 NS" 

12 6 11 

2 1 1 

12 8 8 NS" 

18 13 20 
5 1 0 

19 14 16 NS" 

13 8 12 

3 0 0 

12 9 8 NS" 

18 12 20 

5 1 0 



The following patients were not enrolled in this study as described in Materials and methods. "Patients bearing UGT1A1*28 homozygous 
were excluded from the FLIGHTl and FLIGHT2 studies. ''Homozygous and heterozygous of UGT1A1*28 were not enrolled in the FRUTIRI 
study. 'Homozygous of UGTlAl *6 and *28 and compound heterozygous of UGTlAl *6 and *28 were not included in the consecutive patients 
received second-hne FOLFILI therapy. ""NS, not significant among 4 groups by Fisher's exact test. 



then used to determine the optimal genotype combinations for 
predicting the risk of irinotecan toxicity. The statistical pattern 
recognition, SFFS, identified the genotype combinations with 
the 'maximum number of cases' and 'maximum prediction 
rate' to maximize overall diagnostic accuracy (24). Briefly, 
the algorithm of the SFFS used in this study was as follows: 
i) Suppose that at stage k we have a set of X,, ...,Xi^ of sizes 1 to 
k, respectively, ii) Let the corresponding values of the feature 



selection criteria be Jj to where 7, = 7(X,), for the feature 
selection criterion J(). iii) Let the total set of features be X. 
Then at the kth stage of the SFFS procedure follow these steps: 
Step 1, select the feature Xj from X-X/, that increases the value of 
J to the greatest degree and add it to the current set: = Xj, 
+ Xj. Step 2, find the feature x,. in the current set X^J^^^ that 
reduces the value of / the least; if this feature is the same as Xj 
then set 7(4+ 1) = J{X^,.^^^\ increment k\ go to step 1; otherwise 
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Table II. Minor allele frequency and Hardy- Weinberg equilib- 
rium in 123 patients. 



103 patients" 123 patients'" 





MAF 


HWp 


MAF 


HWp 


UGT1A1*6 [211 (G>A)] 


0.18 


1.00 


0.17 


1.00 


UGT1A1*28 [(TA)(,>(TA)7] 


0.12 


0.80 


0.08 


0.86 


UGT1A1*60 [-3279 (T>G)] 


0.27 


0.99 


0.24 


0.92 


UGT1A7 [387 (T>G)] 


0.42 


1.00 


0.39 


0.07 


UGT1A7 [622 (T>C)] 


0.27 


0.61 


0.24 


0.54 


UGTlA9*lb [-118 (T5>T,o)] 


0.41 


0.84 


0.38 


0.13 



Tatients enrolled in the FLIGHT 1, FLIGHT2 and FRUTIRI studies 
(patients received lower starting dose of irinotecan were not excluded). 
''Patients subjected to case-control study (patients received lower 
starting dose of irinotecan were excluded). MAF, minor allele fre- 
quency. HWp, p- value of Hardy-Weinberg equilibrium. 



A The Lewontin's coefficient D' B The correlation coefficient 




^^^^^^ :d n D ID D D 




Figure 1. Pairwise linkage disequilibrium relationships between the 
UGTIA variants. (A) The Lewontin's coefficient D' and (B) the correlation 
coefficient r- are represented as values and colors [in panel A, log of the 
odds (LOD) >2 shades of pink/red, LOD <2 and D'=l is blue, and LOD <2 
and D' <1 is white. In panel B, r-<0.01 is white, 0.01<r-<0.95 is shades of 
grey, and r^>;0.95 is black] in each box. 



remove it from the set to from X\ = X^^^^^ - x^. Step 3, continue 
removing features from the set X\ to form reduced sets X\^. _ ,) 
while J(X\^_ ,)) > J^l._ ^^;k = k - 1; until k = 2; then continue with 
step 1 . The algorithm is initialized by setting k = Q and Xq = 0. 

Statistical analysis. Fisher's exact test was used to assess the 
relationship between toxicity and each UGTIA variant. The 
Cochran-Armitage trend test was used to examine the linearity 
of the relationship between UGTIA genotypes and irinotecan 
toxicity. SPSS Statics 17.0 software (IBM, Tokyo, Japan) and 
R version 2.13.0 software were used to perform the calcula- 
tions (30). p<0.05 was considered statistically significant. 

Results 

UGTIA allele and haplotype frequencies. The minor 
allele frequencies (MAF) of each UGTIA allele among the 
103 patients without genetic bias; all patients regardless of the 
starting dose of irinotecan enrolled in FLIGHTl, FLIGHT2, 
and FRUTIRI studies, and 123 patients received a starting 
dose of 150 mg/m^ for case-control study participating in 
this study are listed in Table II. In this study, the MAFs of 
UGT1A1*28 and UGT1A1*6 were approximately 0.117 and 
0.184, respectively. The MAF for each other UGTIA SNP 
examined in this study was greater than 0.20. Among all 
patients, the Hardy-Weinberg equilibrium p-value for each 
locus examined in this study was higher than 0.05. LD analysis 
with 103 patients showed that high LD (r^>0.9) was evident 
between UGT1A7 (387T>G) and UGTlA9*lb (Fig. 1). We 
found 12 UGTIA haplotypes (Hp-I to Hp-XIT) using 6 loci 
in 103 patients: UGT1A1*6, *28, *60, UGT1A7 (387T>G), 
UGT1A7 {622T>C), and UGTlA9*lb (Table III). Three 
common haplotypes (Hp-I, Hp-II and Hp-III) accounted for 
82.5% of all haplotypes identified in this study. 

Associations between UGTIA genotypeslhaplotypes and irino- 
tecan toxicity. We examined associations between individual 



UGTIA genotypes or haplotypes and severe irinotecan toxicity 
among 123 patients with mCRC who receive chemotherapy that 
included irinotecan (Table IV). Each of four UGTIA genotypes 
[UGT1A1*6, UGT1A7 (387T>GX UGT1A7 (622T>C) and 
UGTlA9*lb] showed a significant association to irinotecan 
toxicity and linear trend (p<0.05). Similarly, two haplotypes 
(Hp-I and Hp-IT) each showed a significant association to and 
linear trend with irinotecan toxicity (p<0.05). Among two 
patients received a starting dose of 100 mg/m^ irinotecan, diplo- 
type of Hp-IV/V did not show toxicity and diplotype of Hp-V/V 
showed toxicity. Six patients excluded from FRUTIRI study did 
not show toxicity of irinotecan (a starting dose of 70 mg/m^; 
UGTIA diplotypes of Hp-I/V, II/IV and III/IV were found in 2, 
3 and 1 patients). Regarding non-hematological toxicities, only 
5 patients developed grade 3 diarrhea (UGTIA diplotype of these 
5 patients consists of 4 Hp-II II and 1 Hp-IIIXII). 

Performances of the toxicity prediction system by genotype 
combination. To construct a system for predicting the risk of 
severe irinotecan toxicity, genetic data from 73 patients that 
constituted the training population were analyzed exhaus- 
tively; specifically, SFFS was used to assess gender, age and the 
individual genotypes at six polymorphic UGTIA sites (Fig. 2). 
In addition to the three possible genotypes (wild-type homozy- 
gous, heterozygous, variant homozygous), a fourth option for 
each site (designated 'unspecified genotype') was included into 
the algorithm. Similarly, patient gender (male, female, regard- 
less of gender) and age (<60, >60 years old, regardless of age) 
were assessed. The cutoff value for age (60 years) was deter- 
mined by Youden index obtained by the receiver operating 
characteristic (ROC) curve analysis with the training popu- 
lation. Among possible combinations (4^ x 3^ - 1 = 36,863), 
the following cases were excluded: cases not found, single 
cases, and cases that represented positive or negative predic- 
tive values <80%. In order to optimize the combinations. 
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Table III. Haplotype frequency. 



UGTIA alleles 

Allele frequencies 

UGT1A9 UGT1A7 UGTlAl 



Haplotypes 


*lb 


387T>G 


622T>C 


*60 


*28 


*6 


(n=103)'' 


(n=123r 


Hp-I 


T,o 


T 


T 


T 


TA, 


G 


0.524 


0.573 


Hp-II 


T/ 


CP 


C 


T 


TAg 


A" 


0.170 


0.159 


Hp-III 


T- a 




T 


G'' 


TAg 


G 


0.131 


0.134 


Hp-IV 


T/ 






G'' 


TA/ 


G 


0.063 


0.041 


Hp-V 


T,o 


T 


T 


G" 


TA/ 


G 


0.044 


0.028 


Hp-VI 


T- a 
"-9 


G^ 


C 


T 


TAg 


G 


0.015 


0.016 


Hp-VII 


T- a 
•-9 


G^ 




G'' 


TA, 


G 


0.015 


0.012 


Hp-VIII 


T/ 


G^ 


T 


G" 


TA/ 


G 


0.010 


0.012 


Hp-IX 


T,o 


T 


T 


G^ 


TA, 


G 


0.010 


0.008 


Hp-X 


T,o 


G^ 




T 


TA, 


A" 


0.010 


0.008 


Hp-XI 


T/ 


G^ 


T 


T 


TA, 


G 


0.005 


0.004 


Hp-XII 


T,o 


T 


T 


T 


TA, 


A" 


0.005 


0.004 



"Association of thie alleles with toxicity of irinotecan. ''Patients enrolled in the FLIGHT 1 , FL1GHT2 and HiUTlRl studies (patients received lower 
starting dose of irinotecan were not excluded). "Patients subjected to the case-control study (patients received lower starting dose of irinotecan 
were excluded). 



A Combinations for cases with toxicity (Positive) 



L^GT genotypes Clinical features 





1A1'6 


1Ar28 


1A1*60 


1A7 


1A7 


1A9nb Sex 


Age 


G3, G4/Total 


Name 


211(G>A) 




.3279(T>G) 


3S7(T>G) 


622(T>CJ 


■11S[T)9.n> 




n (%) 


P-l 






*« 


** 








4/4 (100.0) 


P-ll 


f G/A 


** 


*« 


** 


** 


Tg/Tio Male 




4/4 (100.0) 


P-tll 




** 


*« 


** 




** «* 


** 


2/2 (100.0) 


P-IV 






** 




T/C 


** «• 


** 


2/2 (100.0) 


P-V 




TAe/TAs 


t/g 


** 


T/C 


** « 




2/2 (100.0) 


P-VI 




Air 


** ** 








9/10 (90,0) 


P-VII 




** 


*« ** 


T/C 


** «* 




14/17 (82,4) 


P-VIII 




ft* 


** 




T/C 






4/5 (80.0) 


















19/22 (86.4) 


B Combinations for cases without toxicity (Negative) 












i/GT genotypes 




Clinical features 






1A1'6 


iAi'2a 


1A1*60 


1A7 


1A7 


lA9'lb Sex 


Age 


G0-G2/Total 


Name 








3B7(T>G) 


e22(T>C) 






n (%) 


N-l 






** 




** 


Male 




6/6 (100.0) 


N-ll 




TAe/TAr 












5/5 (100.0) 


N-lll 


Kg/g 


TA/TAg 


T/G 


** 


** 






373 (100.0) 


N-rv 




TA^A, 


T/G 




** 


Mate 




3/3 (100.0) 


N-V 


Hg/g 


** 






M 




** 


2/2 (100.0) 


N-VI 










** 


2/2 (100.0) 


N-VII 






T/T 


** 


Male 


** 


12/15 (80.0) 


N-Vltl 


*« 


>A 


T/T 


** 




Ti(/Tio Male 




12/15 (80.0) 


N-IX 


•* 


** 


in 


** 


** 






12/15 (80.0) 


N-X 










8/10 (80.0) 



33/40 (82.5) 

Figure 2. The UGTIA genotype combinations that predict the presence or absence of severe irinotecan toxicity based on statistical pattern recognition. (A) 
A total of 8 combinations (P-I to P-VIII) for positive prediction of the toxicity and (B) 10 combinations (N-I to N-X) for negative prediction are presented. 
(A) The 8 combinations (P-I to P-VIII) that predict the presence of irinotecan toxicity are shown. (B) The 10 combinations (N-I to N-X) that predict the 
absence of irinotecan toxicity are shown. Eight factors-patient age, patient gender and genotypes at six UGTIA sites [UGT1A1*6, *28, *60, 1A7 {387T>G), 
1A7 {622T>C) and lA9*lh] were used with sequential floating forward selection (SFFS) for statistical pattern recognition as described in Materials and 
methods. Homozygosity for alleles associated with irinotecan toxicity, heterozygosity and homozygosity for alleles not associated with irinotecan toxicity 
are indicated by red, blue and green cells, respectively. "The un-specified categories (regardless of genotypes, gender or age). 
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Table IV. Associations between UGTIA genotypes/haplotypes and irinotecan toxicity. 

Toxicity 



p-value 



Yes 



No 



(% of yes) 



Fisher's exact 



CA trend 



Genotypes 
UGT1A1*6 



UGT1A1*28 



UGT1A1*60 



UGT1A7 
(387T>G) 

UGT1A7 
(622T>C) 

UGTlA9*lb 



Haplotypes 
Hp-I 



Hp-II 
Hp-III 



Clinical features 
Gender 

Age 



-/- 
-/*6 
*6/*6 

-/- 
-11*28 
1*28/1*28 
-/- 
-11*60 
1*60/1*60 
387T/T 
387T/G 
387G/G 
622T/T 
622T/C 
622C/C 
9*lb/9*lb 
-/9*lb 
-/- 

0" 
V 
2" 
0" 
P 
2" 
0" 
V 
2" 

Male 
Female 
<60 
>60 



27 
21 
3 
40 
11 

27 
20 
4 
9 
34 
8 
18 
31 
2 
9 
34 
8 

12 
32 

7 
27 
22 

2 
38 
12 

1 

31 
20 
15 
36 



57 
15 

0 
63 

9 

44 
26 

2 
32 
35 

5 
52 
17 

3 
34 
33 

5 

6 
37 
29 
59 
13 

0 
53 
19 

0 

47 
25 
35 
37 



(32.1) 
(58.3) 
(100.0) 
(38.8) 
(55.0) 

(38.0) 
(43.5) 
(66.7) 
(22.0) 
(49.3) 
(61.5) 
(25.7) 
(64.6) 
(40.0) 
(20.9) 
(50.7) 
(61.5) 

(66.7) 
(46.4) 
(19.4) 
(31.4) 
(62.9) 

(100.0) 
(41.8) 
(38.7) 

(100.0) 

(39.7) 
(44.4) 
(30.0) 
(49.3) 



0.002 
0.218 
0.349 
0.005 
<0.001 
0.003 

0.002 
0.001 
0.517 

0.705 
0.027 



0.001 



0.219 



0.002 



<0.001 



0.001 



<0.001 



<0.001 



0.900 



"Number of alleles carried by the patient. CA, Cochran-Armitage trend test. 



categorization according to predictive value and exclusion of 
redundant combinations in each category were performed. As 
a result, 8 combinations (P-1 to P-VIII, Fig. lA) appeared to 
predict an increased risk of toxicity, and 10 combinations (N-I 
to N-X, Fig. IB) appeared to predict a lack of toxicity. 

The system for predicting irinotecan toxicity based on 
combinations of 8 factors (6 genotypes, gender and age) was 
generated using data from of all 73 patients in the training 
population. The system was then applied to data from 84.9 and 
86.0% of the patients in the training and validation popula- 
tions, respectively (Table V). This prediction system showed 
83.9% accuracy (positive predictive value, 86.4%; negative 
predictive value, 82.5%) for the training population (n=62) 
and 72.1% accuracy (positive predictive value, 70.0%; negative 



predictive value, 72.7%) for the validation population (n=43). 
When patients who were not applied to the combinations were 
included, the performance of the system was 71.2% accuracy 
(sensitivity, 55.9%; specificity, 84.6%) in training population 
(n=73) and 62.0% accuracy (sensitivity, 41.2%; specificity, 
72.7%) in validation population (n=50). Odds ratios of positive 
prediction for irinotecan toxicity for this prediction system were 
8.0 (95% CI, 1.5-42.5) and 16.3 (95% CI, 2.2-121.4) in training 
and validation populations, respectively (p<0.05, Table VI). 

Patients with either of three UGTIA alleles [UGT1A1*6, 
UGT1A7 (622T>C) or UGTlA9*lh], UGTIA haplotype-I or 
haplotype-II showed significant association to severe irino- 
tecan toxicity (p<0.05) in both the training and validation 
populations (data not shown). 
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Training (n=73) 


Vahdation (n= 


=50) 


n 


(%) 


n 


(%) 


Matched with the combination^ 


62/73 


(84.9) 


43/50 


(86.0) 


Accuracy in applied patients 


52/62 


(83.9) 


31/43 


(72.1) 


Positive predictive value* 


ly/zz 


(oo.4) 


7/10 


(70.0) 


Negative predictive value'' 


33/40 


(82.5) 


24/33 


(72.7) 


Accuracy 


52/73 


(71.2) 


31/50 


(62.0) 


Sensitivity 


19/34 


(55.9) 


7/17 

//If 




Specificity 


33/39 


(84.6) 


24/33 


(72.7) 



The combination consists of 8 factors; 6 genotypes [UGT1A1*6, UGT1A1*28, UGT1A1*60, UGT1A7 (387T>G), UGT1A7 (622T>C) and 
UGTlA9*lb] , gender and age. ''Prediction of severe toxicity is positive and prediction of no severe toxicity is negative. 



Discussion 

The novel system for predicting severe irinotecan toxicity 
described here was based on genotypes at 6 polymorphic sites 
in UGTIA and 2 basic clinical features; notably, it showed high 
predictive performance even though the treatment regimens 
differed among the training and validation patients (Tables V 
and VI). The odds ratio of positive prediction for severe irino- 
tecan toxicity was higher for this prediction system than for that 
of any other haplotype or for that of any genotype (Table VI). 
The performance of this prediction system was reduced from 
the 83.9% accuracy seen with applied patients to this system 
in the training population to 72.1% accuracy in the validation. 
With regard to positive prediction, the inconsistency in accuracy 
between training and validation populations was seen when the 
combinations included the UGTlA9*lb site and patient age (P-II, 
VI and VII in Fig. 2). The frequencies of UGTlA9*lb geno- 
type differed between the training and validation populations; 
moreover, the UGTlA9*lb alleles were not in Hardy-Weinberg 
equilibrium in the validation population (data not shown). The 
cutoff value for patient age (60 years old) was determined by a 
ROC curve generated with data from the training population; 
however, previous studies used a cutoff age of 65 years (20,21). 
Indeed, one patient without toxicity, but predicted as presence of 
toxicity in this system, was aged 63 years. 

Some genotypic combinations decreased the performance 
of negative prediction for sever irinotecan toxicity in the vali- 
dation population relative to the training population (N-Il, IV, 
and V in Fig. 2). Specifically, 36.4% (n=4/ll) of patients in 
training population with a combined genotype that included 
heterozygous for UGT1A1*28 alleles and UGT1A1*6 (-/-) 
experienced severe irinotecan toxicity, but 66.7% (n=4/6) of 
the patients in validation population with the same genotype 
combinations (UGT1A1*6, -I- and UGT1A1*28, -/+) showed 
severe toxicity. Of the 73 patients in the training population and 
the 50 in the validation population, 11 (15.1%) and 7 (14.0%), 
respectively, were matched with neither of the combination in 
our prediction system. Interestingly, the incidence of severe 
toxicity among patients who were not matched with either 
combination identified by this prediction system was 72.7% 



(training population) and 14.3% (validation population) 
(Table VI). Therefore, the frequency of the irinotecan toxicity 
among patients who do not have any combination of UGTIA 
variants identified by this novel prediction system might be 
due to factors other than UGTIA polymorphisms. 

Many published studies have focused on associations 
between irinotecan toxicity, irinotecan efficacy, or both and 
any one or more of each UGTIA variants examined here 
(10-19,31,32). Patients, especially Asian patients, homozygous 
for UGT1A1*6 or *28 or compound heterozygous for these 
variants are at high risk for hematologic toxicity (13,33,34). In 
this study, each patient homozygous for UGT1A1*6 (n=3) and 
those compound heterozygous for UGTIAI'^'6 and *2S (n=3) 
showed severe hematologic toxicity; however, 45 patients of 
the remaining 117 patients still exhibited severe irinotecan 
toxicity. UGT1A1*6 and *2(S each have strong effects on 
UGTlAl activity and expression, but frequency of each allele 
is low; moreover, the frequencies of each allele differ between 
races (11,14,35-37). Among the patients that lacked these rare, 
highly effective variants, this novel prediction system could 
accurately predict whether there is severe irinotecan toxicity. 

Here, as in previous studies, each identified UGTIA 
haplotypes was useful for precisely predicting the presence or 
absence of severe irinotecan toxicity (14,18,38-40). Consistent 
with our study, Cecchin et al reported that a haplotype 
comprising UGT1A1*28 (-), UGT1A1*60 (-), UGT1A7 (3877 
and 622T), and UGTlA9*lb {+) was a predictor of severe 
hematologic toxicity during the entire course of therapy (18). 
However, determining the haplotypes for any one patient is 
a difficult clinical measurement. Therefore, the genotypes at 
each of the 6 sites (rather than the haplotype or diplotype) 
could be used for clinical assessments. 

Our prediction system depend not only on UGTIA geno- 
types but also on patient gender and age. Previous studies 
showed that patient gender and age were related to the risk 
of irinotecan toxicity (20-22). In the training population, 
patient age was associated with severe irinotecan toxicity, but 
patient gender was not (Table IV). Interestingly, when patient 
age, patient gender or both the patient age and gender were 
excluded from the factors used by the prediction system, the 
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number of patients that matched with the prediction system 
decreased, although the system maintained the high positive 
and negative predictive values (data not shown). 

The SFFS algorithm could be modified to include other 
factors (e.g., mutations in the tumor, patients' clinical character- 
istics, additional genetic variants, etc.) to improve the prediction 
performance. Such modifications may result in a system that 
could meaningfully predict clinical outcomes, including tumor 
response. Recent advances in technology for sequencing whole 
genomes of individuals may lead to substantial increases in 
information that might be useful for personalized therapy. 
However, such complicated information could not be efficiently 
or fully utilized in the currently available formats. SFFS could 
easily construct a system that can utilize huge data sets such as 
whole-genome sequences. Our strategy for developing SFFS- 
based systems for clinical use could serve as a powerful tool for 
advancing personalized therapy, although additional prospec- 
tive study of this prediction system is needed. 
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